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Abstract —For uplink large-scale MIMO systems, linear mini¬ 
mum mean square error (MMSE) signal detection algorithm is 
near-optimal but involves matrix inversion with high complexity. 
In this paper, we propose a low-complexity signal detection 
algorithm based on the successive overrelaxation (SOR) method 
to avoid the complicated matrix inversion. We first prove a 
special property that the MMSE filtering matrix is symmetric 
positive definite for uplink large-scale MIMO systems, which 
is the premise for the SOR method. Then a low-complexity 
iterative signal detection algorithm based on the SOR method 
as well as the convergence proof is proposed. The analysis 
shows that the proposed scheme can reduce the computational 
complexity from 0(K 3 ) to (D(K 2 ), where K is the number 
of users. Finally, we verify through simulation results that the 
proposed algorithm outperforms the recently proposed Neumann 
series approximation algorithm, and achieves the near-optimal 
performance of the classical MMSE algorithm with a small 
number of iterations. 

I. Introduction 

Large-scale multiple-input multiple-output (MIMO) is con¬ 
sidered as a key technology for future wireless communica¬ 
tions JT). Unlike the traditional small-scale MIMO technology 
(e.g., at most 8 antennas in LTE-A), large-scale MIMO ex¬ 
ploits a very large number of antennas (e.g., 128 antennas or 
even more) at the base station (BS) to simultaneously serve 
multiple user equipments (UEs) 0. It has been theoretically 
proved that large-scale MIMO can provide potential opportu¬ 
nity to increase the spectrum and energy efficiency by orders 
of magnitude 0. 

However, some challenging problems have to be solved to 
realize such attractive merits of large-scale MIMO in practice. 
One of them is the practical signal detection algorithm in the 
uplink 0| . The optimal detector is the maximum likelihood 
(ML) detector whose complexity exponentially increases with 
the number of transmit antennas, which makes it impractical 
for large-scale MIMO systems. To achieve the (close) optimal 
ML detection performance with reduced complexity, several 
non-linear signal detection algorithms have been proposed. 
One typical category is based on the sphere decoding (SD) 
algorithm 0, such as the fixed-complexity sphere decoding 
(FSD) algorithm 0. This kind of algorithms uses the un¬ 
derlying lattice structure of the received signal and considers 
the most promising approach to achieve the ML detection 
performance with reduced complexity. It performs well for 
the conventional small-scale MIMO systems, but when the 
dimension of the MIMO systems is large or the modulation 


order is high Q (e.g., 128 antennas at the BS with 64 QAM 
modulation), the complexity is still unaffordable. Another 
category is based on the tabu search (TS) algorithm derived 
from artificial intelligence 0, such as the layered tabu search 
(LTS) algorithm 0. This kind of algorithms utilizes the idea 
of local neighborhood search to estimate the transmitted signal 
and limits the selection of neighborhood by a tabu list. When 
the neighborhood range is appropriately small and the tabu list 
is carefully designed, the complexity is acceptable for large- 
scale MIMO systems, but it suffers from a non-negligible 
performance loss compared to the optimal ML detector. To 
make a trade-off between the performance and complexity, 
one can resort to linear signal detection algorithms, such 
as the zero-forcing (ZF) and minimum mean square error 
(MMSE) algorithms, which are near-optimal for uplink multi¬ 
user large-scale MIMO systems 0j. However, these algorithms 
involve unfavorable inversion of a matrix of large size, whose 
complexity is still high for large-scale MIMO systems. Very 
recently, to reduce the complexity of matrix inversion, m 
proposed the Neumann series approximation algorithm to 
convert the matrix inversion into a series of matrix-vector mul¬ 
tiplications. However, only marginal reduction in complexity 
can be achieved. 

In this paper, we propose a matrix inversion-less signal 
detection algorithm with low complexity based on the SOR 
method DU for large-scale MIMO systems. We first prove 
that the MMSE filtering matrix is symmetric positive definite 
for uplink large-scale MIMO systems, according to which we 
propose to exploit the SOR method to avoid the complicated 
matrix inversion. We also prove the convergence of the pro¬ 
posed signal detection algorithm to guarantee its feasibility 
in practice. We verify through simulation results that the 
proposed algorithm can efficiently solve the matrix inversion 
problem in an iterative procedure until the desired detection 
accuracy is attained. To the best of our knowledge, this work is 
the first one to utilize the SOR method for the signal detection 
in uplink large-scale MIMO systems. 

The rest of the paper is organized as follows. Section HT1 
briefly describes the system model. Section [III] specifies the 
proposed low-complexity signal detection algorithm, together 
with the convergence proof and the complexity analysis. The 
simulation results of the bit error rate (BER) performance 
are provided in Section [IV] Finally, conclusions are drawn 
in Section [V] 


Notation : We use lower-case and upper-case boldface letters 
to denote vectors and matrices, respectively; (-) T , (-) H , (-) -1 , 
and |-| denote the transpose, conjugate transpose, matrix in¬ 
version, and absolute operators, respectively; Re{-} and Im{-} 
denote the real part and imaginary part of a complex number, 
respectively; Finally, represents the N xN identity matrix. 

II. System Model 

We consider a uplink large-scale MIMO system employing 
N antennas at the BS to simultaneously serve K single¬ 
antenna UEs 0,0. Usually we have N » K, e.g., 
N = 128 and K = 16 have been considered in 0 . 

The transmitted bit streams from different users are first 
encoded by the channel encoder and then mapped to sym¬ 
bols by taking values from a modulation alphabet. Let 
s c = [s c ,i, • • •, Sc,if] T denote the transmitted signal vector 
from all K users, and H c £ C :V x K denote the flat Rayleigh 
fading channel matrix, whose entries are assumed to be 
independently and identically distributed (i.i.d.) with zero 
mean and unit variance 0. Then the received signal vector 
y c = [y c ,i, ■ • •, Vc,nY at the BS can be represented as 


III. Low-Complexity Signal Detection For Uplink 
Large-Scale MIMO 

In this section, we first prove a special property of large- 
scale MIMO systems that the MMSE filtering matrix is 
symmetric positive definite. Based on this property, we then 
propose a low-complexity signal detection algorithm utilizing 
the SOR method to iteratively achieve the MMSE estimate 
without matrix inversion. The convergence proof is also ad¬ 
dressed. Finally, we provide the complexity analysis of the 
proposed algorithm to show its advantage over conventional 
schemes. 

A. Matrix inversion-less signal detection utilizing SOR method 

Unlike the conventional (small-scale) MIMO systems with 
small number of antennas, large-scale MIMO systems enjoy a 
special property that the column vectors of the channel matrix 
are asymptotically orthogonal 0- Based on that, we prove 
that the MMSE filtering matrix is symmetric positive definite 
in the following Lemma 1. 

Lemma 1. For uplink large-scale MIMO systems, the MMSE 
filtering matrix W is symmetric positive definite. 


y c =H c s c +n c , (1) 

where n c = [n c ,i, ■ • •, n Cj jv] T is the noise vector whose entries 
are i.i.d and follow the distribution CJ\f(0, o' 1 ). 

For signal detection, the complex-valued system model (1) 
can be converted into a corresponding real-valued one as 


Proof: Since the complex-valued MIMO system model has 
been converted into the real-valued one, the transpose of 
matrix and the conjugate transpose of matrix will be the same, 
e.g., G = H h H = H t H. Thus, we have 

G t = (H t H) t = H t H = G, (6) 


y = Hs + n, 


( 2 ) 


where y = [Re{y c } Im{y c }] T is of size 2N x 1, accord¬ 
ingly s = [Re{s c } Im{s c }] T , n = [Re{n c } Im{n c }] T , 
and 


H = 


Re{H c } 

Im{H c } 


- Im{H c } 
Re{H c } 


2Nx2K 


(3) 


which indicates that the Gram matrix G is symmetric. Mean¬ 
while, for uplink large-scale MIMO systems, the column 
vectors of the real-valued channel matrix H are asymptotically 
orthogonal 0, i.e., the equation Hq = 0 has an unique 
solution, which is the 2K x 1 zero vector. Thus, for any 
2 K x 1 non-zero real-valued vector r, we have 

(Hr) T Hr = r T Gr > 0, (7) 


At the BS, after the channel matrix H has been obtained 
through time-domain and/or frequency-domain training pi¬ 
lots |[T2l H~3ll . the task of signal detection is to recover the 
transmitted signal vector s from the received signal vector 
y. It has been proved that the linear MMSE signal detection 
algorithm is near-optimal for uplink multi-user large-scale 
MIMO systems 0, and the estimate of the transmitted signal 
vector s can be obtained by 

s = (H H H + a 2 I 2Jf )- 1 H ff y = W- 1 y, (4) 

where y = H^y, and the MMSE filtering matrix W is 
denoted as 

W = G + t T 2 I 2K , (5) 

where G = H h H is the Gram matrix. The computational 
complexity of the direct matrix inversion W ' 1 is 0(K : ‘‘), 
which is high for large-scale MIMO systems. 


which implies that G is positive definite. Considering (6) 
and (7), we can conclude that the Gram matrix G =H T H 
is symmetric positive definite. Finally, as the noise variance 
a 2 is positive, the MMSE filtering matrix W = G + c 2 l2 K 
in (5) is symmetric positive definite, too. ■ 

The special property that the MMSE filtering matrix W 
in uplink large-scale MIMO systems is symmetric positive 
definite inspires us to exploit the SOR method to efficiently 
solve (4) with low complexity. The SOR method is used to 
solve iV-dimension linear equation Ax = b, where A is the 
N x N symmetric positive definite matrix, x is the N x 1 
solution vector, and b is the N x 1 measurement vector. 
Unlike the traditional method that directly computes A -1 b 
to obtain x, the SOR method can efficiently solve the linear 
equation in an iterative manner without the complicated matrix 
inversion. Since matrix A is symmetric positive definite, we 
can decompose it into a diagonal component Da, a strictly 
lower triangular component La, and a strictly upper triangular 




component L^. Then the SOR iteration can be described 
as im 

x(‘+D 


— (LaH—Da) 
w 


-l 


1 


w 


(- —1)D a — Li )x« + b 


( 8 ) 


where the superscript i = 0,1,2, • • • denotes the number of 
iterations, and w represents the relaxation parameter, which 
plays an important role in the convergence and the convergence 
rate. Note that when w = 1, the SOR method is the same as 
the well known Gauss-Seidel method CD, which means that 
the Gauss-Seidel method is a special case of the SOR method. 
We will discuss the selection of the relaxation parameter w in 
detail later in Section [TV] 

Due to the MMSE filtering matrix W is symmetric positive 
definite for uplink large-scale MIMO systems as proved in 
Lemma 1, we can also decompose W as 


W = D + L + L t , (9) 


According to the definition of eigenvalue, we have 

Cr= (L + -D)- 1 (-D-D-L r )r = A„r, (14) 
w w 

where r is an arbitrary 2 K x 1 non-zero real-valued vector. 
Note that (14) can be also presented as 

(-D-D-L T )r = (L + -D)A„r. (15) 

w w 

Multiply both sides of (15) by r T will yield 

r T (-D-D-L T )r = A„r T (L + -D)r. (16) 

w w 

Then we take transpose on both sides of (16), and another 

equation can be obtained as 

r T (—D — D — L)r = A n r T (L T + —D)r. (17) 

w w 

Note that D = D r as D is a diagonal matrix. Add (16) and 
(17) will lead to 


where D, L, and L T denote the diagonal component, the 
strictly lower triangular component, and the strictly upper 
triangular component of W, respectively. Then we can utilize 
the SOR method to estimate the transmitted signal vector s as 
below 


s( i+1 ) 


(L+-D)” 1 

w 



1)D-L t ^ s^+y 


( 10 ) 


where s' 0) denotes the initial solution, which is usually set as 
a 2 K x 1 zero vector without loss of generality UTl . Then the 
solution to the signal detection problem (4) can be solved by 
the SOR method according to 

(L+-D)s (i+1) = y + ((- - l)D - L t ) s W. (11) 
w \ w ) 


As (L+iD) is a lower triangular matrix, one can solve the 
equation (11) to obtain s0 +1 ) with low complexity as will be 
addressed in Sectionlllll-C. Next we will prove the convergence 
of the proposed signal detection based on the SOR method. 


B. Convergence proof 


Lemma 2. For uplink large-scale MIMO systems, the signal 
detection algorithm using the SOR method is convergent when 
the relaxation parameter w satisfies 0 < w < 2. 

Proof: We define C = (L + ^D)- 1 (iD - D L T ) and 
d = (L + ^D) -1 y, where C is called as the iteration matrix. 
Then the SOR iteration (11) can be rewritten as 

s (i+1) = Cs (i) + d. (12) 

The spectral radius of the iteration matrix C is defined 
as the non-negative number p(C) = max |A n |, where A„ 

denotes the nth eigenvalue of C. The necessary and sufficient 
conditions for the convergence of (12) is that the spectral 
radius should satisfy ifTTl Theorem 7.2.2] 

p{ C) = max |A n | < 1. (13) 

V l<n<2K 


r T ((— — 2)D — L L t ) r = A„r T (L T +L+—D)r. (18) 
\ w J w 

Substituting (9) into (18), we have 

(l-A„)(--l)r T Dr = (l + A n )r T Wr. (19) 
w 

Since the MMSE filtering matrix W is positive definite as 
proved above, the diagonal matrix D is positive definite, too. 
Then we have r 7 Dr > 0 and r T Wr > 0. Besides, we also 
have (^ 1) > 0 if 0 < re < 2. Thus, we can conclude that 

(1 — A ra )(l + A n ) > 0, which means 

|A„ < 1. (20) 

Substituting (20) into (13), we can assert that p( C) < 1, so 
the SOR iteration (11) is convergent. ■ 

It is worth pointing out that another different proof of 
Lemma 2 can be found in fl4l Theorem 11.2.3], which utilizes 
the orthogonal transformation with high complexity to obtain 
the convergence proof, while our method directly exploits the 
definition of eigenvalue, which is simpler than the existing 
method El. 


C. Computational complexity analysis 

The computational complexity in terms of required number 
of multiplications is analyzed in this part. It can be found from 
(11) that the computational complexity of the ?'th iteration 
of the proposed signal detection algorithm originates from 
solving the linear equation. Considering the definition of D, 
L, and L T , the solution can be presented as 


s 


(i+l) 

m 


(1 - w)s$ 


'(.Vn 


W„ 


E ww4 i+1) 

k<.m 


E W m,kS { ^), ( 21 ) 

k>m 


m, k = 1,2, • • -2 K, 

(i) (i+l) 

where sif , i , and y m denote the TOth element of 
S M, s 6+!), and y in (4), respectively, and denotes 







10 ° 


the mth row and /*:th column entry of W. The required 
number of multiplications in the computation of (1 — w)sm 

and E W m,kSk +1) - E Wm,fc4 S) ) is 1 aIld 

k<m k>m 

2 K + 1, respectively. Therefore the computation of each ele¬ 
ment of s( z+1) requires 2 K + 2 times of multiplications. Since 
there are 2 1\ elements in s" ,+ l \ the overall required number 
of multiplications is 4 K 2 + 1 K. 


TABLE I 

Computational Complexity 



Conventional Neumann series 
approximation algorithm 1101 

Proposed signal detection 
algorithm 

i = 2 

12 K 2 - 41< 

8 K 2 + 8 K 

i = 3 

8 K 3 + 4 K 2 - 2 K 

12 K 2 + 12 K 

i = 4 

16A' 3 - 4 K 2 

16A' 2 + 16A 

i — 5 

24A 3 - 12A 2 + 2 1< 

20A' 2 + 20A 


Table Q] compares the complexity of the conventional Neu¬ 
mann series approximation algorithm GDI and the proposed al¬ 
gorithm based on the SOR method. Since the complexity of the 
classical MMSE algorithm is 0(K 3 ), we can conclude from 
Table U that the conventional Neumann series approximation 
algorithm can reduce the complexity from 0(K 3 ) to 0(K 2 ) 
when the number of iterations is i = 2, but the complexity is 
still 0(K 3 ) when i > 3. To ensure the approximation perfor¬ 
mance, usually a large value of i is required to approach the 
final MMSE solution s as will be verified later in Section [TV] 
So the overall complexity is almost the same as the MMSE 
algorithm, which means only marginal reduction in complexity 
can be achieved. However, we can observe that the complexity 
of the proposed algorithm is (D(K 2 ) for arbitrary number of 
iterations. And even for i — 2, the proposed algorithm enjoys 
a lower complexity than the conventional one CUD- 

Additionally, we can observe from (21) that the computation 
of .Sm +I "* utilizes s^ +1) for k = 1, 2, • • •, m — 1 and s[^ for 
l = m,m + 1, • • 2 K, which is similar to the Gauss-Seidel 

method ED- Then, two another benefits can be expected. 

Firstly, after st +11 has been obtained, we can use it to 

(i) 

overwrite Sm which is useless in the next computation of 
sy’"Ei . Consequently, only one storage vector of size 2 K x 1 
is required; secondly, when i increases, the solution to (11) 
becomes closer to the final MMSE solution s. Thus E, -1 '' 
can exploits the elements of for k = 1, 2, • • •, m — 1 that 
have already been computed in the current iteration to produce 
more reliable result than the conventional algorithm ESI only 
utilizing all the elements of s' 1 ' in the previous iteration. Thus, 
a faster convergence rate can be expected, and the required 
number of iterations to achieve a certain estimation accuracy 
becomes smaller. Based on these two special advantages of the 
SOR method, the overall complexity of the proposed algorithm 
can be reduced further. 

IV. Simulation Results 

To verify the performance of the proposed signal detection 
algorithm, we provide the BER simulation results compared 
with the recently proposed Neumann series approximation 
algorithm (ED. The BER performance of the classical MMSE 
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Fig. 1. BER performance of the proposed SOR-based signal detection 
algorithm against the relaxation parameter w, where SNR = 4 dB and i = 3. 
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Fig. 2. BER performance comparison when N x K = 64 x 8. 

algorithm with complicated but exact matrix inversion is 
included as the benchmark for comparison. Besides, to verify 
the near-optimal performance of the MMSE algorithm, the 
performance of the optimal ML detection algorithm is also 
provided. We consider two large-scale MIMO systems with 
N x K = 64 x 8 and N x K = 128 x 16, respectively. The 
modulation scheme of 64 QAM is adopted. The rate-1/2 in¬ 
dustry standard convolutional code with generator polynomials 
[133 0 171 0 ] is employed, and a random interleaver is also 
used to combat the burst error. The Rayleigh fading channel 
model is considered. After multi-user signal detection, the 
estimated signal vector is used to extract the soft information 
(by calculate the log-likelihood ratios (LLRs)) for soft-input 
Viterbi decoder for channel decoding. 

Fig. 1 shows the BER performance of the proposed SOR- 
based signal detection algorithm against the relaxation param¬ 
eter w, where the signal-to-noise ratio (SNR) is 4 dB, and the 
number of iterations is i = 3. As shown in Fig. 1, BERs of the 
MMSE algorithm are 2.8854 x 10 -3 for N x K = 64 x 8, 
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Fig. 3. BER performance comparison when N X K = 128 x 16. 

and 2.7656 x 10~ 3 for N x K = 128 x 16, respectively, 
which are the targets to be approached by selecting the optimal 
relaxation parameters. We can observe that the BER curve 
against w looks like a parabola, and fortunately the optimal 
w for both systems is 1.05. Furthermore, we have conducted 
intensive simulations of different large-scale MIMO system 
configurations and found that the systems with fixed N/K 
(e.g., N/K = 8 in Fig. 1) will share the same optimal selection 
of w, which indicates that we can easily obtain the optimal w 
after the system dimensions N and K have been fixed. 

The BER performance comparison between the conven¬ 
tional Neumann series approximation algorithm Col and 
the proposed SOR-based signal detection algorithm when 
N x K = 64 x 8 and N x K = 128 x 16 are shown in Fig. 
2 and Fig. 3, respectively, where i denotes the number of 
iterations. It is clear that the BER performance of both 
algorithms improves with the increased number of iterations. 
However, when the same iteration number i is used, the 
proposed algorithm outperforms the conventional one for both 
systems. Moreover, as we can observe from Fig. 2, the BER 
performance of the proposed algorithm when i = 3 is almost 
the same as that of the conventional one when i = 5, which 
indicates that a faster convergence rate can be achieved by 
the proposed SOR-based signal detection algorithm. As we 
have addressed in Section HTI1-C. a faster convergence rate 
means smaller number of iterations is required to achieve a 
certain estimation accuracy, so the complexity of the proposed 
algorithm can be reduced further. 

Meanwhile, we can observe from Fig. 2 and Fig. 3 that 
the MMSE algorithm is near-optimal compared to the optimal 
ME detection algorithm, and the proposed algorithm without 
the complicated matrix inversion can achieve the near-optimal 
BER performance of the MMSE algorithm when the number 
of iterations is large (e.g., i = 3 in Fig. 2 and Fig. 3). 

V. Conclusions 

In this paper, by fully exploiting a special channel prop¬ 
erty of the large-scale MIMO systems, we propose a low- 
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complexity near-optimal signal detection algorithm based on 
the SOR method in the uplink. The SOR-based algorithm 
can iteratively realize the MMSE solution without compli¬ 
cated matrix inversion, which can reduce the complexity from 
0(K 3 ) to 0(K 2 ). We also prove the convergence of the 
proposed algorithm, and simulation results show that it can 
achieve the near-optimal performance of the classical MMSE 
algorithm with a small number of iterations. Moreover, the 
idea of utilizing the SOR method to efficiently realize matrix 
inversion with low complexity can be extended to other signal 
processing problems in wireless communications, such as the 
precoding in the large-scale MIMO systems. 
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